Sensitivity of Leverage Scores and Coherence for Randomized Matrix Algorithms

نویسندگان

  • ILSE C. F. IPSEN
  • THOMAS WENTWORTH
چکیده

Leverage Scores. Statistical leverage scores were introduced in 1978 by Hoaglin and Welsch [8] to detect outliers when computing regression diagnostics, see also [3, 12]. To be specific, consider the least squares problem minx ‖Ax− b‖2, where A is a real m× n matrix with rank(A) = n. The so-called hat matrix H ≡ A(AA)A is the orthogonal projector onto range(A), and determines the fit b̂ ≡ Hb. The diagonal elements of the hat matrix are called leverage scores of A,

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast approximation of matrix coherence and statistical leverage

The statistical leverage scores of a matrix A are the squared row-norms of the matrix containing its (top) left singular vectors and the coherence is the largest leverage score. These quantities have been of interest in recently-popular problems such as matrix completion and Nyström-based low-rank matrix approximation; in large-scale statistical data analysis applications more generally; and si...

متن کامل

Sensitivity of Leverage Scores

The sampling strategies in many randomized matrix algorithms are, either explicitly or implicitly, controlled by statistical quantities called leverage scores. We present four bounds for the sensitivity of leverage scores as well as an upper bound for the principal angles between two matrices. These bounds are expressed by considering two real m×n matrices of full column rank, A and B. Our boun...

متن کامل

Ridge Regression and Provable Deterministic Ridge Leverage Score Sampling

Ridge leverage scores provide a balance between low-rank approximation and regularization, and are ubiquitous in randomized linear algebra and machine learning. Deterministic algorithms are also of interest in the moderately big data regime, because deterministic algorithms provide interpretability to the practitioner by having no failure probability and always returning the same results. We pr...

متن کامل

Input Sparsity Time Low-rank Approximation via Ridge Leverage Score Sampling

Often used as importance sampling probabilities, leverage scores have become indispensable in randomized algorithms for linear algebra, optimization, graph theory, and machine learning. A major body of work seeks to adapt these scores to low-rank approximation problems. However, existing “low-rank leverage scores” can be difficult to compute, often work for just a single application, and are se...

متن کامل

ALEVS: Active Learning by Statistical Leverage Sampling

Active learning aims to obtain a classifier of high accuracy by using fewer label requests in comparison to passive learning by selecting effective queries. Many active learning methods have been developed in the past two decades, which sample queries based on informativeness or representativeness of unlabeled data points. In this work, we explore a novel querying criterion based on statistical...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013